Eclectic English Vocab
404wolf.com·9h
🔄Incremental Lexing
Claude Sonnet vs GLM 4.6: A Token Efficiency Comparison
reddit.com·1d·
Discuss: r/ClaudeAI
🚀Tokenizer Performance
Appen Finds LLMs Struggle with Idioms and Culture in Multilingual AI Translations
slator.com·1h
🎮Language Ergonomics
TypeNet Benchmark for development of authentication keystroke technologies
github.com·1d·
Discuss: Hacker News
🌱Minimal ML
Lazy text capitalization with low latency large language models
blog.florianschulz.info·1d·
Discuss: Hacker News
🔄Incremental Lexing
Why LLMs Hallucinate on Emojis (And 4 Tokens That Break Production AI)
dev.to·4h·
Discuss: DEV
🌊Gradual Effects
Training Dynamics of Parametric and In-Context Knowledge Utilization in Language Models
arxiv.org·6h
🪜Recursive Descent
What Makes a Language Look Like Itself?
towardsdatascience.com·3d
🔤Language Tokenizers
BULaMU-The First Luganda Large Language Model Trained from Scratch
reddit.com·1d·
Discuss: r/LocalLLaMA
🌱Minimal ML
Beyond the Prompt: A Developer's Playbook for Ethically Scaling B2B Content with GenAI
getmichaelai.com·23h·
Discuss: DEV
🎮Language Ergonomics
Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)
magazine.sebastianraschka.com·23h·
Discuss: Hacker News
🌱Minimal ML
DiffuSpec: Unlocking Diffusion Language Models for Speculative Decoding
arxiv.org·6h
🚀Tokenizer Performance
Fun with HyperLogLog and SIMD
vaktibabat.github.io·2d·
🔢Bit Manipulation
LLM-Based Instance-Driven Heuristic Bias in the Context of a BRKGA
researchgate.net·1d·
Discuss: Hacker News
🪜Recursive Descent
Constraint Satisfaction Approaches to Wordle: Novel Heuristics and Cross-Lexicon Validation
arxiv.org·6h
🧩Constraint Solvers
Technical Explanations Why LLMs Use Em Dashes
msukhareva.substack.com·20h·
Discuss: Substack
🚀Tokenizer Performance
Eliminating the Precision–Latency Trade-Off in Large-Scale RAG
thenewstack.io·2d
🔍Text Indexing
Opti's Claude 4.5 Sonnet "vibe coding" report
stacker.news·22h
🔬Nanopasses
Optimize efficiency with language analyzers using scalable multilingual search in Amazon OpenSearch Service
aws.amazon.com·3d
🔤Language Tokenizers